Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts

نویسندگان

  • Min Chu
  • Yao Qian
چکیده

This paper proposes a three-tier prosodic hierarchy, including prosodic word, intermediate phrase and intonational phrase tiers, for Mandarin that emphasizes the use of the prosodic word instead of the lexical word as the basic prosodic unit. Both the surface difference and perceptual difference show that this is helpful for achieving high naturalness in text-to-speech conversion. Three approaches, the basic CART approach, the bottom-up hierarchical approach and the modified hierarchical approach, are presented for locating the boundaries of three prosodic constituents in unrestricted Mandarin texts. Two sets of features are used in the basic CART method: one contains syntactic phrasal information and the other does not. The one with syntactic phrasal information results in about a 1% increase in accuracy and an 11% decrease in error-cost. The performance of the modified hierarchical method produces the highest accuracy, 83%, and lowest error cost when no syntactic phrasal information is provided. It shows advantages in detecting the boundaries of intonational phrases at locations without breaking punctuation. 71.1% precision and 52.4% recall are achieved. Experiments on acceptability reveal that only 26% of the mis-assigned break indices are real infelicitous errors, and that the perceptual difference between the automatically assigned break indices and the manually annotated break indices are small.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linguistic Patterns Detected Through a Prosodic Segmentation in Spontaneous Taiwan Mandarin Speech

This paper proposes that spontaneous speech, segmented into perceptually coherent prosodic constituents, is able to provide plentiful linguistic information in which clear patterns can be observed. We present pioneering studies with empirical and quantitative evidence, supporting the notion that prosodic units can be useful for the automatic processing of spontaneous speech. High inter-labelers...

متن کامل

Features of Prosodic Hierarchy Boundary between Uygur and Mandarin Chinese

The study explored the perception and acoustic features of prosodic hierarchy boundaries in Uygur and Mandarin Chinese. The results of the perception study showed that native speakers of Mandarin Chinese were more sensitive to prosodic hierarchy boundaries than native speakers of Uygur. The results of a detailed acoustic analysis indicated that the acoustic features of Uygur prosodic hierarchy ...

متن کامل

Unsupervised joint prosody labeling and modeling for Mandarin speech.

An unsupervised joint prosody labeling and modeling method for Mandarin speech is proposed, a new scheme intended to construct statistical prosodic models and to label prosodic tags consistently for Mandarin speech. Two types of prosodic tags are determined by four prosodic models designed to illustrate the hierarchy of Mandarin prosody: the break of a syllable juncture to demarcate prosodic co...

متن کامل

Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech

Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degr...

متن کامل

Prosodic marking of topic constructions in Mandarin Chinese

This study examines the prosodic marking of topic constructions in Mandarin Chinese. The findings suggest that, even though on the surface there are prosodic markings that differentiate topics from comments, the difference is the product of the prosodic phrasing, coupled with the declination and final lowering, in that a topic construction is usually decomposed into to two prosodic constituents...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCLCLP

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2001